

MJM Do. No. 5038-118 

MASS STOlWiE CACHING PROCESSES 
FOR POWER REDUCTION 



5 BACKGROUND 

1. Field 

This disclosure relates to storage caching processes for power reduction, more 
particularly to caches used in mobile platforms. 

2. Background 

10 Mobile computing applications have become prevalent. Some of the tools 

used for these applications, such as notebook or laptop computers have a hard disk. 
Accessing the hard disk typically requires spinning the disk, which consumes a 
considerable amount of power. Operations such as reading, writing and seeking 
consume more power than just spinning the disk. 

15 One possible approach is to spin down the disk aggressively, where the disk is 

stopped after short periods of time elapse during which no operations are performed. 
However, accessing the disk in this approach requires that the disk be spun back up 
prior to accessing it. This introduces time latency in system performance. 

Conventional approaches tune the mobile systems for performance, not for 

20 power consumption. For example, most approaches write back to the hard disk, 

writing "through" any storage cache. Usually, this is because the cache is volatile and 
loses its data upon loss of power. In many mobile operations, there is a concern 
about loss of data. 

Another performance tuning approach is to prefetch large amounts of data 
25 from the hard disk to the cache, attempting to predict what data the user wants to 
access most frequently. This requires the disk to spin and may actually result in 
storing data in the cache that may not be used. Similarly, many performance 
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techniques avoid caching sequential streams as are common in multimedia 
applications. The sequential streams can pollute the cache, taking up large amounts of 
space but providing little performance value. 

Examples of these approaches can be found in US Patent Nos. 4,430,712, 
5 issued February 2, 1984; 4,468,730, issued August 28, 1984; 4,503,501, issued March 
5, 1985; and 4,536,836, issued August 20, 1985. However, none of these approaches 
take into account power saving issues. 



10 to the drawings, wherein: 

Figure 1 shows one example of a platform having a non- volatile cache 
memory system, in accordance with the invention. 

Figure 2 shows a flowchart of one embodiment of a process for satisfying 
memory operation requests, in accordance with the invention. 
15 Figure 3 shows a flowchart of one embodiment of a process for satisfying a 

read request memory operation, in accordance with the invention. 

Figure 4 shows a flowchart of one embodiment of a process for satisfying a 
write request memory operation, in accordance with the invention. 



Figure 1 shows a platform having a memory system with a non- volatile cache. 
The platform 10 may be any type of device that utilizes some form of permanent 
storage, such a hard, or fixed, disk memory. Generally, these permanent memories 
are slow relative to the memory technologies used for cache memories. Therefore, the 
25 cache memory is used to speed up the system and improve performance, and the 
slower permanent memory provides persistent storage. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The invention may be best understood by reading the disclosure with reference 
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DETAILED DESCRIPTION OF THE EMBODIMENTS 




The cache memory 14 may be volatile, meaning that it is erased any time 
power is lost, or non- volatile, which stores the data regardless of the power state. 
Non- volatile memory provides continuous data storage, but is generally expensive and 
may not be large enough to provide sufficient performance gains to justify the cost. In 
5 some applications, non- volatile memory may constitute volatile memory with a 
battery backup, preventing loss of data upon loss of system power. 

A new type of non- volatile memory that is relatively inexpensive to 
manufacture is polymer ferroelectric memory. Generally, these memories comprise 
layers of polymer material having ferroelectric properties sandwiched between layers 
10 of electrodes. These memories can be manufactured of a sufficient size to perform as 
a large, mass storage cache. 

Known caching approaches are tuned to provide the highest performance to 
the platform. However, with the use of a non- volatile cache, these approaches can be 
altered to provide both good performance and power management for mobile 
15 platforms. Spinning a hard disk consumes a lot of power, and accessing the disk for 
seek, read and write operations consumes even more. Mobile platforms typically use 
a battery with a finite amount of power available, so the more power consumed 
spinning the disk unnecessarily, the less useful time the user has with the platform 
before requiring a recharge. As mentioned previously, allowing the disk to spin down 
20 introduces time latencies into memory accesses, as the disk has to spin back up before 
it can be accessed. The non- volatile memory allows the storage controller 16 to have 
more options in dealing with memory requests, as well as providing significant 
opportunities to eliminate power consumption in the system. 

Other types of systems may use other types of main memories other than hard 
25 disks. Other types of systems may include, but are not limited to, a personal 

computer, a server, a workstation, a router, a switch, a network appliance, a handheld 
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computer, an instant messaging device, a pager, a mobile telephone, among many 
others. There may be memories that have moving parts other than hard disks. 
Similarly, the non-volatile memory may be of many different types. The main system 
memory, analogous to a hard disk, will be referred to as the storage device here, and 
5 the non- volatile cache memory will be referred to as such. However, for ease of 
discussion, the storage device may be referred to as a hard disk, with no intention of 
limiting application of the invention in any way. 

The storage controller 1 6 may be driver code running on a central processing 
unit for the platform being embodied mostly in software, a dedicated hardware 

10 controller such as a digital signal processor or application specific integrated circuit, 
or a host processor or controller used elsewhere in the system having the capacity for 
controlling the memory operations. The controller will be coupled to the non- volatile 
cache memory to handle input-output requests for the memory system. One 
embodiment of method to handle memory requests is shown in Figure 2. 

15 A memory request is received at 20. The memory request may be a read 

request or a write request, as will be discussed with regard to Figures 3 and 4. The 
memory controller will initially determine if the cache 22 can satisfy the request. 
Note that the term 'satisfied' has different connotations with regard to read requests 
than it does for write requests. If the cache can satisfy the request at 22, the request is 

20 satisfied at 24 and the memory controller returns to wait for another memory request 



If the cache cannot satisfy the request at 22, the storage device is accessed at 
26. For hard disks, this will involve spinning up the disk to make it accessible. The 
disk memory operation is then performed at 28. Finally, any queued memory 
25 operations will also be performed at 30. Queued memory operations may typically 
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include writes to the disk and prefetch read operations from the disk as will be 
discussed in more detail later. 

Having seen a general process for performing memory operations using the 
memory system of Figure 1, it is now useful to turn to a more detailed description of 
some of the individual processes shown in Figure 2. Typically, write requests will 
remain within the process of satisfying the request from cache, as the nature of 
satisfying the request from cache is different for write operations than it is for read 
operations. Write operations may also be referred to as first access requests and read 
operations may be referred to as second access requests. 

Figure 3 shows an example of a read operation in accordance with the 
invention. The process enclosed in the dotted lines corresponds to the disk memory 
operation 28 from Figure 2. At this point in the process, the read request cannot be 
satisfied in the cache memory. Therefore, it is necessary to access the disk memory. 
A new cache line in the cache memory is allocated at 32 and the data is read from the 
disk memory to that cache line at 34. The read request is also satisfied at 34. This 
situation, where a read request could not be satisfied from the cache, will be referred 
to as a 'read miss.' Generally, this is the only type of request that will cause the disk 
to be accessed. Any other type of memory operation with either be satisfied from the 
cache or queued up until a read miss occurs. Since a read miss requires the hard disk 
to be accessed, that access cycle will also be used to coordinate transfers between the 
disk memory and the cache memory for the queued up memory operations. 

One situation that may occur is a read request for part of a sequential stream. 
As mentioned previously, sequential streams are generally not prefetched by current 
prefetching processes. These prefetching processes attempt to proactively determine 
what data the user will desire to access and prefetch it, to provide better performance. 
However, prefetching large chunks of sequential streams does not provide a 
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proportional performance gain, so generally current processes do not perform 
prefetches of sequential data streams. 

Power saving techniques, however, desire to prefetch large chunks of data to 
avoid accessing the disk and thus consuming large amounts of power. The method of 
5 Figure 3 checks to determine if the new data read into the cache from the disk is part 
of a sequential stream at 36. Generally, these sequential streams are part of a 
multimedia streaming application, such as music or video. If the data is part of a 
sequential stream, the cache lines are deallocated in the cache from the last prefetch at 
38, meaning that the data in those lines is deleted, and new cache lines are prefetched 

10 at 40. The new cache lines are actually fetched, a prefetch means that the data is 
moved into the cache without a direct request from the memory controller. 

If the data is not from a sequential stream, the controller determines whether or 
not a prefetch is desirable for other reasons at 42. If the prefetch is desirable, a 
prefetch is performed at 40. Note that prefetches of sequential streams will more than 

15 likely occur coincident with the disk memory operations. However, in some cases, 
including some of those prefetches performed on non-sequential streams, the prefetch 
may just be identified and queued up as a queued up memory operations for the next 
disk access, or at the end of the current queue to be performed after the other queued 
up memory operations occur at 30 in Figure 2. 

20 In summary, a read operation may be satisfied out of the cache in that the data 

requested may already reside in the cache. If the request cannot be satisfied out of the 
cache, a disk memory operation is required. In contrast, a write request will be 
determined to be satisfied out of the cache. Because the cache is large and non- 
volatile, write requests will typically be performed local to the cache and memory 

25 operations will be queued up to synchronize data between the cache and the disk. One 
embodiment of a process for a write request is shown in Figure 4. 
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Referring back to Figure 2 and replicated in Figure 4, the general process 



determines if the current request can be satisfied in the cache. For most write 



requests, the answer will be deemed to be yes. The processes contained in the dotted 



box of Figure 4 correspond to the process of satisfying the request from cache at 24 in 
5 Figure 2. At 50, the memory controller determines whether or not there are already 
lines allocated to the write request. This generally occurs when a write is done 
periodically for a particular application. For example, a write request may be 
generated periodically for a word processing application to update the text of a 
document. Usually, after the first write request for that application occurs, those lines 

10 are allocated to that particular write request. The data for the write request may 
change, but the same line or line set in the cache is allocated to that request. 

If one or more lines are allocated to that write request at 50, the allocated ache 
line or lines are overwritten with the new data at 58. If the cache has no lines 
allocated to that request, new lines are allocated in 52 and the data is written into the 

15 allocated lines at 54. Generally, this 'new' memory request will not have any 

counterpart data in the disk memory. A disk memory operation to synchronize this 
newly allocated and written data is then queued up at 56 to be performed when the 
next disk access occurs. It might also be deferred beyond the next time the disk is- - 
spun up. Since the memory is non-volatile, the disk does not need to be updated soon. 

20 These queued up memory operations may include the new cache writes, as just 

discussed, as well as prefetches of data, as discussed previously. Periodically, the 
memory controller may review the queue of memory operations to eliminate those 
that are either unnecessary or that have become unnecessary. 

Several write requests may be queued up for the same write request, each with 

25 different data, for example. Using the example given above, the document may have 
made periodic backups in case of system failure. The memory controller does not 
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need to perform the older ones of these requests, as it would essentially be writing the 
data to almost immediately write over it with new data. The redundant entries may 
then be removed from the queue. 

A similar culling of the queue may occur with regard to read operations. A 
5 prefetch previously thought to be desirable may become unnecessary or undesirable 
due to a change in what the user is currently doing with the platform. For example, a 
prefetch of another large chunk of a sequential data stream may be in the queue based 
upon the user's behavior of watching a digital video file. If the user closes the 
application that is accessing that file, the prefetches of the sequential stream for that 

1 0 file become unnecessary. 

In this manner, only read misses will cause the disk to be accessed. All other 
memory operations can be satisfied out of the cache and, if necessary, queued up to 
synchronize between the cache and the disk on the next disk access. This eliminates 
the power consumption associated with disk access, whether it be by spinning the 

15 disk, as is done currently, or both other means which may become available in the 
future. 

Since the write operations or second memory access requests may be satisfied 
by writing to the cache, they may be serviced or satisfied first. Read operations may 
require accessing the storage device, and therefore may be serviced after the second 

20 access request. 

In the case of a rotating storage device such as a hard drive, most of these 
operations will either begin or end with the storage device being spun down. One 
result of application of the invention is power saving, and spinning a rotating storage 
device consumes a large amount of the available power. Therefore, after a memory 

25 access request occurs that requires the hard disk to be spun up, the hard disk will more 
than likely be spun down in an aggressive manner to maximize power conservation. 
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Thus, although there has been described to this point a particular embodiment 
for a method and apparatus for mass storage caching with low power consumption, it 
is not intended that such specific references be considered as limitations upon the 
scope of this invention except in-so-far as set forth in the following claims. 
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